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Abstract 

It's the age-old recurrence with a twist: sum the last two terms and if the result 
is composite, divide by its smallest prime divisor to get the next terni (e.g., 0, 1, 1, 
2, 3, 5, 4, 3, 7, ... ). These sequences exhibit pseudo-random behaviour and generally 
terminate in a handful of cycles, properties reminiscent of 3x + 1 and related sequences. 
We examine the elementary properties of these 'subprime' Fibonacci sequences. 

1 Introduction 

When John Conway last visited the first author, he passed the time on the piane by calcu- 
lating what we now cali subprime Fibonacci sequences. They are just the sort of thing 
Martin Gardner would have featured in his column. There is some risk of their becoming 
as notorious as the 3x + 1 problem (if a number is odd, triple it and add one; if even, 
halve it), with which they seem to have something in common, and of which Erdòs has said, 
"Mathematics is not yet ready for such problems." 

Start with the Fibonacci sequence 0, 1, 1, 2, 3, 5, ... , but before you write down a 
composite term, divide it by its least prime factor so that this next term is not 8, but rather 
8/2 = 4. After that the sum gives us 5 + 4 = 9, but we write 9/3 = 3, then 4 + 3 = 7 which 
is okay since it is prime, then 3 + 7 = 10 but we write 10/2 = 5, and so on: 

1 1 2 3 5 4 3 7 5 6 11 17 14 31 15 23 19 
21 20 41 61 51 56 107 163 135 149 142 97 239 168 37 41 39 40 
79 17 48 13 61 37 49 43 46 89 45 67 56 41 97 69 83 76 
53 43 48 13 61 37 ... 

and we are in a cycle of 18 terms. If we start with 1, 1 or 1, 2 we clearly get the same result. 
But we may start with any pair of numbers, and you may like to try starting with 2, 1, or 
1, 3, or 3, 9, or 13, 11, or ... . 

One might suspect that every such sequence enters this 18-cycle, just as it is conjectured 
that every 3x + 1 sequence enters the 3-cycle 1, 4, 2, 1, 4, 2, 1, .... After ali, since our 
sequences are bounded or unbounded they must either end in a cycle or increase indefinitely. 
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We do not believe the latter happens, and though we have no proof we provide a heuristic 
argument in §3J But is the 18-cycle the only 'non-trivial' cycle? Wait and see. 

First, note that a, a, where a 7^ ±1 gives the sequence a, a, a, a, ... . This is a trivial 
cycle. Sequences that end in trivial cycles are trivial sequences; e.g., 5, 15, 10, 5, 5, 5, 
. . . , or —143, 39, —52, —13, —13, —13, .... If two consecutive terms have the sanie sign 
then so do ali subsequent terms. If they are of opposite sign or include a zero, then they 
bound further terms until two consecutive terms of the same sign appear, e.g., —17, 7, —5, 
2, —3, —1, —2, . . . , after which the sign remains Constant. 

Next, note that two terms of opposite parity are followed by an odd term, and that two 
odd terms may be followed by an even or an odd term, depending on whether their sum is 
a multiple of 4 or an odd multiple of 2. Finally, one can have an arbitrarily long string of 
even terms, but it must terminate since the power of two in consecutive terms must decrease 
steadily, e.g., 128, 160, 144, 152, 148, 150, 149, . . . , and once we have an odd term (unless 
this is a trivial sequence) , the even terms are isolated with each followed by at least two odd 
terms. Therefore, we need only concern ourselves with sequences of positive terms, comprised 
of these 'runs ' of odd terms separated by even terms. 

Finally, let the shape of a sequence be the string of its terms' parities (O for odd, E for 
even). The Fibonacci sequence has shape EOO EOO EOO EOO . . . . Our sample subprime 
Fibonacci sequence has shape EOOEOOEOOOEOO ... ; the 'extra' odd term here arises 
when 2 is a composite's smallest prime factor such that dividing by it gives an odd number. 
In the example, starting at 13, 61 inclusive gives the shape OOOOOEOOOEOOOOEOOE 
that repeats with the cycle (though for cycles, the starting point doesn't matter; 
OOOEOOOOEOOEOOOOOE is just as valid as a shape for the cycle). 

2 Nodes and other cycles 

To get a 'sense' for the originai sequence, one could plot its trajectories on a directed graph. 
This approach is quite favored for the 3x + 1 problem [2] , but because of the recurrence we 
must be purposeful in defining vertices for this sequence. We introduce two important terms: 

• The nodes of a sequence are the ordered pairs of two positive coprirne odd integers 
not preceded by an odd term. 

• Runs are the strings of odd terms and a terminating even generated by a node, such 
that each node corresponds to the first two terms of a run. 

In §3] we will see that at some point a non-trivial sequence becomes composed of runs. Here 
is the same sequence with nodes parenthesized: 

(1, 1) 2 (3,5) 4 (3, 7) 5 6 (11, 17) 14 (31,15) 23 19 
21 20 (41, 61) 51 56 (107, 163) 135 149 142 (97, 239) 168 (37, 41) 39 40 
(79, 17) 48 (13, 61) 37 49 43 46 (89, 45) 67 56 (41, 97) 69 83 76 
(53,43) 48 (13,61) 37 ... 

As already discussed, we can treat each substring of O . . . OE as a unit, starting when 
the first two terms of such a substring are coprirne and are not preceded by an odd. The 
corresponding terms comprise a run, and the first two terms of the run comprise a node. Let 
us now construct our first sequence graph (Figure [[]). For notational convenience we weight 
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the digraph by assigning to each are the length of the run corresponding to the node at the 
arc's tail. 




Figure 1: Digraph generated by the 0,1 sequence 



One reason a digraph is a useful representation is that many nodes cari be direct prede- 
cessors to a single node. If a node is a predecessor (not necessarily direct) to another node 
or cycle, we say it is tributary to the node or cycle. Some examples are shown in Figure [2J 




Figure 2: Some tributaries to the 18-cycle 



Can we enumerate a node's direct predecessors? Yes, but with some work. For example, 
take the node (89, 45) of the 18-cycle. The preceding even term t must satisfy t + 89 = 45g, 
where q is 1 or 3 (q = 2 makes t odd, prime q > 3 would follow 89 with 15q instead), which 
gives t = —44 or 46, so the node must always be proceeded by 46. Let the odd term (by 
definition) before t be s. Then s + 1 = 89p, where p is 1 or an odd prime < 89. For t = 46, 
possible values of s are 43, 221, 399, etc. 

The term before s must also be odd. If this term is r, it must satisfy r + s = 2t since 
r + s is even. For example, s = 43 gives r = 49. In fact, none of the other possible values 
of s work since they would make r < 0. In fact, we can keep assuming each prior step 
used a division of two, which gives —83, 109, 13, 61, 37, 49, 43, 46. Thus (109,13), 
(13,61), (61,37), (37,49), and (49,43) are exactly the direct predecessors. This process for 
constructively generating the graph has thus far discouraged a graph theory approach to 
analysis. 

Do sequences ali finish up in the 18-cycle we have already seen, similar to how we suspect 
ali 3x + 1 sequences end in 1, 4, 2, 1, 4, 2, 1, ... ? Let us start at the node (151, 227): 
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(151, 227) 189 208 (397, 121) 259 190 (449, 213) 331 272 (201, 43) 122 (55, 59) 57 58 
(23, 27) 25 26 (17, 43) 30 (73,103) 88 (191,93) 142 (47,63) 55 59 57 58 
(23, 27) ... 

and we are in a 19-cycle whose first repeated node is (23,27). Note that though 55, 59 are 
the first two repeated terms, they only act as a node the first time through; thus (47, 63) 
being a node with the terms 55, 59 in its run does not preclude (55, 59) from being a node 
in another context. Both nodes would then be predecessors to the node (23,27). 




Figure 3: Digraphs of the six known non-trivial cycles 



If you start with 5, 13 you will enter a 136-cycle through node (47, 23) (though simpler 
starting terms like 1, 4 will suffice). If you start with 5, 23 you will enter a 56-cycle through 
node (119, 109) with 5693 as its largest term. The four cycles we have seen so far were found 
by hand, and it was not until we used a computer that we found two shorter cycles: an 
11-cycle generated by the node (37,199) and a 10-cycle generated by the node (127,509). 
We checked sequences that start with two numbers 1,000,000 or below and found no other 
non-trivial cycles. 

In Table[T]the headings indicate the range for the first two terms of the sequence and the 
values are how often each cycle occurs. The distribution of non-trivial cycles is quite stable 
with regard to the starting conditions, suggesting that the number of predecessors to a node 
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is fairly arbitrary and not determined by the cycle it is tributary to, such that the number 
of starting conditions tributary to a non-trivial cycle increases at a roughly equal rate for 
each cycle. However, trivial cycles decrease in proportion since a cycle a,a . . . requires ali 
earlier terms to be multiples of a. Applying the 'direct predecessor' method shows why this 
is, and how this makes relatively few starting conditions lead to a given trivial cycle. 



Cycle length 


1.. 10,1. .10 


1..10 2 ,1..10 2 


1..10 3 , 1..10 3 


1..10 4 , 1..10 4 


1..10 5 , 1..10 5 


1 


14 


348 


10022 


320531 


11588563 


10 








33 


6310 


668764 


11 








390 


34520 


3479974 


18 


63 


4837 


467014 


46985673 


4709133000 


19 





249 


30490 


3090886 


307710709 


56 





188 


21990 


2238493 


224936180 


136 


23 


4378 


470061 


47323587 


4742482810 



Table 1: Distribution of cycles based on ranges for first two terms 



It should be clear why non-trivial sequences exhibit such pseudo-random behavior, in 
their terms and their digraphs, as their definition involves prime factorization. However, 
another source of apparent randomness seems to be the conditional iteration itself, as with 
3x + 1 sequences. For example, if one considers a variant of subprime Fibonacci where 
only division by 5 occurs (when the sum is divisible by 5), similar observations arise. This 
makes certain questions seem intractable, specifically those regarding the growth and end 
conditions of a sequence given its starting conditions. 

3 End conditions 

A sequence must either end in a trivial cycle, a non-trivial cycle, or increase indefinitely. 
These end conditions are of interest; however, it seems more likely here than in the 3x + 1 
problem that sequences do not increase indefinitely. Here is an informai argument that 
supports such a conjecture: 

The Fibonacci sequence has shape EOO EOO EOO EOO . . . , but for subprime sequences, 
when we add two odds there is an even chance that after dividing by 2 the result is odd, 
a quarter of a chance the next one is odd as well, and so on. We would expect a typical 
sequence to have three times as many odds as evens, so that in twelve consecutive terms 
we would divide by 2 six times on average and that among the other six steps the odd sum 
is divisible by 3 around one-third of the time. We expect to have divided by 2 6 • 3 2 = 576 
on average, while the usuai Fibonacci growth would have been only by a factor 12 « 322, 
where = (1 + v / 5)/2 is the golden ratio (the limit of the ratios of two successive Fibonacci 
terms). With this argument we have not considered larger primes than 3 or the rarity of 
primes among large numbers. 

Considering that starting terms 5, 23 produce a term as large as 5693, the difficulty 
of proving the non-existence of divergent sequences seems comparable in difficulty to the 
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analogous problem for the 3x + 1 sequences. However, this does not prevent the deduction 
of results like the following: 

Proposition 1. A non-trivial sequence contains infinitely many primes (not necessarily 
distinti), each greater than their two preceding terms. 

Proof. If after some point the sequence does not contain primes, then at each step a division 
happens and so the maximum value of any two consecutive terms decreases over time. Since 
this value cannot decrease forever, we get a contradiction. (This proposition is stronger than 
only asserting infinitely many primes, as a prime could be generated after the division by a 
prime factor.) □ 

Proposition 2. After some point consecutive terms of a non-trivial sequence are always 
coprirne. 

Proof. The greatest common divisor of two consecutive terms of the sequence cannot in- 
crease. Indeed, gcd(a, b) = gcd(ò, a + b), and so gcd(6, — ) < gcd(a, b). A non-trivial 
sequence must include a prime larger than its two preceding terms. Thereafter the GCD of 
any two consecutive terms is 1. □ 

Corollary 3. If the two starting terms of a sequence are coprirne, the sequence is non-trivial. 

These results justify our earlier definitions of node and run, as they ensure that only 
non-trivial sequences produce digraphs and that digraphs are unique representations (since 
a run uniquely leads into another run with no intervening terms, as evens cannot appear 
consecutively once nodes come into play). 

Another consequence of Proposition [2] and Corollary [3] is a simplification of the search 
for non-trivial cycles. Since ali starting conditions a, b with gcd(a, b) > 1 become trivial 
or reach gcd(a, b) = 1 for consecutive terms, it is suflicient to study starting conditions 
with gcd(a, b) = 1 to enumerate ali non-trivial end conditions. This, combined with the 
memorization of nodes and a lookup table for primes could give a more efficient search 
method than testing ali positive integer ordered pairs. 



4 The general system 

We devote the rest of our paper to the cycles that non-trivial sequences generate. By the 
definition of a run, we can conclude that a non-trivial cycle must consist of a concatenation 
of runs. It follows from Proposition [2] that any two consecutive terms in a non-trivial cycle 
are coprirne. 

When we build a subprime Fibonacci sequence we add two numbers first then divide by 
a prime number or 1. Let us correspond to each term of a sequence or a cycle the smallest 
prime divisor (or 1) by which the sum of the two prior terms was divided. These divisors are 
the signature. For example, the 10-cycle 127, 509, 318, 827, 229, 528, 757, 257, 507, 382 
has signature 7, 1, 2, 1, 5, 2, 1, 5, 2, 2; the initial 7 is the divisor to get 127, after adding 
the preceding cycle terms 507, 382. 

It is possible that signature terms are not bounded, as the 11-cycle has signature 29, 
3, 2, 1, 3, 2, 2, 1, 1, 2, 2 (since one of the intermediate sums is 29 x 37 = 1073). Runs 
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consist of consecutive averages, so given a run within a cycle, only its node (first two terms) 
has signature values not equal to 2. See this in action by noting the shape of the 10-cycle 
(OOEOOEOOOE) and comparing with the signature given above. Using Proposition [TJ 
this result on signatures follows: 

Corollary 4. Let a, b, c be consecutive terms of a sequence of a cycle. Then c > max(a, b) 
if and only if c = a + b, i.e., the corresponding signature value of c must be 1 (and c is thus 
prime). Thus, the largest term of a cycle must be prime with a signature value of 1. 

With the terms and the signature of a cycle, we can establish a homogeneous linear 
system. Let ti, . . . , t m be the terms of the cycle and s 1; . . . , s m be the corresponding signature. 
Then . . . , t m -i + t m = Siti, t m + ti = s 2 t 2 , h + t 2 = s 3 t 3 , tj_ 2 + = SiU, .... In 
matrix form, 



Si 





• 





-1 


-l" 




ti 


-1 


s 2 


• 








-1 




t 2 


-1 


-1 


S3 • 













h 








• 












t m -2 








• 


• -1 


S-m—l 







t-m—1 








• 


■ -1 


-1 


Sm 




tm 



We can now relate signatures to cycles and begin restricting potential cycles: 
Theorem 5. No two cycles have the same signature. 

Proof. This is equivalent to showing that a potential signature Si,...,s m defines at most 
one cycle. Given a potential signature, consider solutions for ti, . . . ,t m over the reals. We 
have a system of m linear homogeneous equations in m variables. In this particular set of 
equations, ali of the variables are expressible through exactly two consecutive ones, so the 
space of real solutions is at most 2- dimensionai. Given consecutive terms ti, t i+ i and positive 
signature values, the equations must reduce to At { + Bt i+ i = ti and Cti + Dt i+ i = t i+ i for 
some positive A, B, C, D. Thus ti is expressible through and the solution space is at 
most 1-dimensional. 

If the solution is 1-dimensional, let one of the terms equal 1. The terms are in Constant 
rational proportion to each other, so we can scale ali the terms until the smallest set of 
integer solutions is produced. The largest term may be prime; this solution is potentially 
a cycle. Further scaling cannot produce another cycle since the largest term would not be 
prime (Corollary H]). □ 

Theorem 6. There are no non-trivial cycles of one run (i.e., one even term). 

Proof. Let (ti, t 2 ) be the node of the run. Sum ali the equations of (0Q) to get 2{ti + - ■ ■+t m ) = 
• ■+s m t m . By definition, s 3 , . . . , s m = 2, so the equation becomes 2(ti+t 2 ) = siti+s 2 t 2 . 
Let ti be the largest prime with s x = 1. Then ti = (s 2 — 2)t 2 . Since t 2 > 1 (dividing a 
composite number by its smallest prime factor will never produce 1) and ti is prime, s 2 = 3 
and ti = t 2 , which is a contradiction since this is a non-trivial cycle. The argument is the 
same if t 2 is the largest prime. □ 
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Since each run has at least 3 terms: 

Corollary 7. There are no non-trìvìal cycles of length below 6. If a cycle of length 6 exists, 
its shape must be OOEOOE . 

The 'trick' of Proposition [6] does not generalize to helping find cycles of more than one 
run. In this regard, we instead look to Theorem [5J because it means that results on signatures 
are necessarily results on cycles, which makes it desirable to relate signature terms to one 
another in a meaningful way A signature is only useful if it produces a 1-dimensional solution 
space, requiring a determinant of 0. 

One such relation is to find the general expression for the determinant of an m-cycle in 
terms of si, . . . , s m , which we leave as an exercise to the reader. Our issue with this approach 
is that it ignores the run-based structure of cycles, and so we present a method where the 
only signature terms of interest are those corresponding to the node of each run (divisors 
not equal to 2). 

5 The run-centric system 

We begin by relating a run's terms with its node: 

Theorem 8. Given a node (a, b) where a, b > are odd, lei b = a + 2 k ~ 2 d with odd (but not 
necessarily positive) d > —a/2 k ~ 2 . The corresponding run is then {a + 2 fc_l Jj„i<i} where i 
goes from 1 to k and J n is the n-th Jacobsthal number. The run has length k, consisting of 
k — 1 > 2 odd terms followed by a single even term a + Jt-id. 

Proof. We justify the exponent k — 2 in 6 = a + 2 k ~ 2 d as it counts the number of divisions 
by 2, which occur for ali terms but the two node terms. Thus k denotes run length. The 
Jacobsthal numbers are defined by the recurrence J n = J n -i + 2 J n _ 2 , where Jq = 0, J\ = 1 
(see A001045I in OEISpQ). The next few are J 2 = 1, followed by 3, 5, 11, 21, ... . Solving 
the recurrence gives J n = |(2™ — (—1)™), and so apart from J they are ali odd. 
We conclude that the first k — 1 members 

a + 2 k ~ 1 J Q d = a, a + 2 k ~ 2 J x d = a + 2 k ~ 2 d, a + 2 l J k ^ 2 d (2) 

are ali odd since a and d are odd by definition, while the last (fc-th) term a + 2°Jk_id is even. 
By the recurrence, each term after the first two is the average of the two previous ones, a 
consequence of the definition of a subprime Fibonacci sequence. The condition d > —a/2 k ~ 2 
ensures ali our terms are positive. □ 

Corollary 9. The terms of the run defined by (a, b) are bounded by a, b, and terms after the 
node are bounded by b, In general, two consecutive run terms bound the rest of the run. 

We refer to Theorem [8] for a more run-based system for a cycle. Write out two runs in 
the style of 

ai, di + 2 kl 2 di, «i + 2 kl 3 g?i, . . . , ai + 2 t /fc 1 _2<ii, cti + Jki-\d\ 
a 2 , a 2 + 2 k2 ~ 2 d 2 , a 2 + 2 k2 ~ 3 d 2 , a 2 + 2J k2 _ 2 d 2 , a 2 + J k2 -\d 2 
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Concatenate the two runs. The two terms after the first run will be the first two terms 
(the node) of the second run. Remembering that J n = J n _i + 2J n _ 2 , we can express these 
terms as 

2oi + J kl di (p 2 + 2)ai + (Jjn-m + Jfcjji 

respectively, where p 2 , q 2 are the least prime divisors of the node of the second run. We 
will reserve the use of 'divisors' to the signature terms of nodes. 

When we bend the two runs into a cycle, the divisors of the first run will be denoted 
by pi and q±. Like with ([!]) we fix the length m of the cycle, which is done by fixing the 
individuai run lengths ki, k 2 . We now have the four equations 



P2«2 

fci-2. 



p 2 g 2 (a 2 + 2 kl - 2 d 2 ) 
Piai 

PiOi(ai + 2 fc2 - 2 di) 



2a x + Jkidi 

(p 2 + 2)ai + (J fcl _ip 2 + Jfejdi 
2a 2 + J k2 d 2 

(pi + 2)a 2 + (Jfc 2 _iPi + Jfc 2 )d 2 



giving four linear homogeneous equations in the four unknowns ai, di, a 2 , d 2 . Subtracting 
the first and third equations from the second and fourth then removing a factor from each 
gives 



2 




-P2 





1 


Jkx-l 


-92 + 1 


-2 fc2 - 2 g 2 


-Pi 





2 


Jk 2 


qi + 1 


-2 fel - 2 gi 


1 


Jk 2 -1 



0. 



(3) 



When writing out this expression or expressions for more runs, since entries of qi appear 
twice there will be 2 n terms for each possible product of pjS and g^s (excluding the product 
of ali Pi, qì) where n is the number of a;. For example, there are 2 terms of gì, 2 other terms 
of gip 2 , and 2 2 = 4 terms of gig 2 . Fortunately, the identity 2 k ~ l — J k = J k _i lets us combine 
these multiple terms into one in each instance. In the n = 2 case, the 4 terms involving gig 2 
simplify to one term: 



d fel J fc2 gig 2 + 2 fc2 1 J fcl g 1 g 2 + 2 Kl L J kl q x q 2 -2 



fci-i 



)fci+A;2-2 



— — (2 fcl 1 — JfcJ (2 fc2 1 — J fc2 ) gig 2 — —J kl -iJk 2 -iQiQ2- 
Using the identity means ([3]) reduces to: 

2 fcl+fc2 ~ 4 pi<?iP2<?2 = d fcl _iJ fc2 _iPip 2 + J fcl d fc2 _ 2 pig 2 + J fel - 2 Jfc 2 giP2 + d fcl _iJ fc2 _igig 2 

+ Jk 1 Jk 2 -lPl + ^fci-ldfegi + Jkx-lJk 2 P2 + dfe 1 J k2 -iq 2 
+ J^Jk 2 — {— l) kl+k2 4 . 
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Applying similar manipulation for 3-run cycles gives: 



J}i X -\Jk2-\Jk z -\V\ViV-A + Jkx-iJkzJks^PiPzqz + Jfc 1 Jjfc 2 -2^jfc 3 -iPi92P3 + Jk 1 Jk 2 -iJk 3 -2Piq2q3+ 

Jki-2Jk 2 -lJk 3 QlP2P3 + Jk 1 -2Jk 2 Jk [i -l ( llP2 ( l3 + Jk 1 -lJk 2 -2Jk 3 QlQ2P3 + Jk 1 -lJk 2 -lJk 3 -lQlQ2Q3 + 
Jki-lJk 2 -lJk 3 P2P3 + Jk 1 -lJk 2 Jk 3 -lP2(l3 + J k x J k 2 -2J k 3 <Ì2P3 + Jk 1 Jk 2 -lJk 3 -l(l2<l3 + 
Jk 1 Jk 2 -lJk 3 -lP3Pl + Jk 1 -\Jk 2 -lJk 3 P3<ll + J^J k 2 J k 3 -2<l3Pl + Jk 1 ~lJk 2 Jk 3 -lQ3 ( ll + 
L ■ L L T k 2 -lJk 3 -lPlQ2 + Jk x -2'h 2 Jk 3 q\P2 + Jkt-lJk 



Jk L -lJk 2 Jk 3 -lPlP2 + Jk 1 Jk 2 -lJk 3 -lPl<l2 + Jk 1 -2Jk 2 Jk 3 1lP2 + Jk 1 -lJk 2 -lJk 3 Ql<l2 + 

Jk 1 Jk 2 Jk 3 -lPl + Jk x ~\Jk 2 Jk 3 P2 + Jk 1 Jk 2 -lJk 3 P3+ 

J k x -\J k 2 J k 3 (l\ + J k x 3 k 2 -\J k 3 <Ì2 + ^fci ^fc 2 <^fc 3 -l93 + 
7 7 7 /I \fcl+fc2+fc3-6 

JkxJk 2 Jk 3 ~ \ — -LJ 



Is there a method to the madness? Yes. Let's move back the subscripts of p cyclically 
(pi — )■ p 3 , P2 — ► Pi; P3 — ^ P2) to see the pattern. Here are some terms after this shifting: 

' ' ' + Jk!~2Jk 2 Jk 3 -l(llPl(l2 + Jki-\Jk 2 -2Jk 3 <ll<l2P2 + Jk l -lJk 2 ~lJk 3 -l c ll<l2<l3 J r 

Jk 1 -lJk 2 -\Jk 3 P\P2 + ^1-1^2^3-1^193 + Jk 1 Jk 2 -2Jk 3 <l2P2 + Jk 1 Jk 2 -lJk 3 -l<l2<l3 + - - " • 

The subscripts of the Jacobsthal numbers are now or fcj- 2, corresponding to 0, 

1 or 2 occurrences of the subscript i on the shifted and in the term. Thus, for unshifted 
p this holds for Pi-i, q%, where po represents p n . 

We can now generalize of this expression for n-run cycles. Here is the determinant for 
3-run cycles (with originai pì): 



2 


Jk! 


-P2 











1 


Jki-1 


-q 2 + 1 


-2 fc2 ~ 2 g 2 














2 


Jk 2 


~P3 











1 


Jk 2 -l 


-93 + 1 




-Pi 











2 


Jk 3 


9i + l 


_ 2 fci-2 gi 








1 


Jk 3 -l 



Q3 



(4) 



We can see that in the 2n x 2n generalization of (Jl|) there are just 7n nonzero entries, 
three in each odd-numbered row, four in each even-numbered row. Every two rows repeat, 
but with the items shifted right two columns (for the last two rows, we 'wrap around' back 
to the first two columns) and the relevant subscripts incremented. 

Notice that the Pi occur only in the odd-numbered columns (or rows), whereas the qi 
occur in ali the columns (i.e., twice in the even-numbered rows). In the expansion there is 
just one term of degree 2n in p i: it is 

ofci+A^H \-k n —2n 

1 P\P2 Pn9l<?2 Qn- 

This is the only term that takes the product of consecutive divisors, e.g., P191, £>292- To 
see why, without loss of generality suppose we have a term with the product p2<?2- Let the 
matrix in (jl]) be (o^). Then the product ai 3 a 2 4 = (— p 2 )( — 2 fc2 ~ 2 g 2 ) was taken. The term 
taken from the next row is necessarily a 35 = — p 3 , and so on until we get the above term. 
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The remaining terms contain the product 

rn Sl n 52 . . . n Sn n in+1 n Sn+2 n &2n 
Pi P2 Pn HI H2 Hn 

where the Si are or 1, with their nonzero values ranging over ali subsets of {1, 2, ... , 2n} 
with cardinality less than or equal to n. A further restriction is that Si and Si +n cannot both 
be 1 for i < n since terms with are excluded. 

With the relationship between the Jacobsthal and divisor subscripts, and letting p refer 
to p n , the equation for n-run cycles may be written as 

n n 

n 2£<*- 2 > ft _ lft = e n ^-wti^ - (~i)^- 2) (5) 

»=1 5i,...,<5 2 „G{0,l}, »=1 

J2Sj<n, 
Vj<n: 8j+S J+n <2 

where the absolute terni Sj = 0) arises from 2 n products of n Jacobsthal numbers with 
n 2s or ls, which combine into a single term (— l)E( fc *~ 2 ) in the same fashion as already 
illustrated, together with the product —Jk 1 Jk 2 ' ' ' Jk n through the +1 s in the + 1 entries. 
Note that the equation holds for n > 2. 

The input for this formula only requires the number of runs in the cycle and the associated 
run confìguration, which is the n-tuple of run lengths and thus a concise version of the 
shape. For example, the run confìguration of the 10-cycle of shape OOEOOEOOOE is 
(ki, &2, ^3) = (3, 3, 4), where m = ^ fcj = 10. It is important to note that as with the shapes 
of cycles, run configurations (3,4,3) and (4,3,3) are identical to (3,3,4) since cycles have 
no definitive starting nodes; what matters is that the order of run lengths is preserved. 

6 Signature restrictions 

Let the terms of an arbitrary cycle be ai, h, ai + 3bl , . . . , a n , b n , s^iI^l^ a "+ 3fe " ; . . . where 

(ai, bi) are the cycle's nodes. Let the respective signature be pi, q±, 2, 2, ... , p n , q n , 2, 2, . . . 
età, where Pi corresponds to Oj and qi corresponds to 6j. Each p i: qi is either 1 or an odd 
prime (the cycle's divisors). As already established, n > 2. 

We were able to disprove one-run cycles and relate the divisors of n-run cycles to one 
another given the run confìguration. But even with a given two-run confìguration, we are left 
with a relation in 4 unknown variables pi, qi,p2, q2- Can we further restrict these variables? 
We already know that at least one of pi, q\, . . . ,p n , q n is 1, and that p x , q%, . . . , p n , q n cannot 
ali equal 1 since cycle terms cannot increase indefinitely. 

Can we strengthen these results? To motivate another approach, consider term vs. index 
for the 10-cycle (Figure HJ), which is composed of three runs: 
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Figure 4: 10-cycle; vertical dashes define the cycle, horizontal dashes illustrate run bounds 

The bounds are a consequence of CorollaryO Remembering Corollary|4]and that if some 
Pi or qi 7^ 1 then the sum of the two terms preceding the corresponding Oj or hi was divided 
by at least 3, we provide three stronger results: 

Proposition 10. At least two ofpi, q±, . . . ,p n , q n do not equal 1. 

Proof. Without loss of generality, let ali but P2 or q 2 be 1. 

Case 1. p 2 7^ 1. Then &i and the rest of the first run are > ai, so 02 > and 6 2 > «i- 
Since ali other p^, qi = 1, ali terms before ai are > ^a±. Then ai > ai, a contradiction. 

Case 2. g 2 7^ 1- Then b\ and the rest of the first run are > ai, so a 2 > 2ai and ò 2 > 0, 
and a2 + b2 > ai. Since ali other Pi,qi = 1, ali terms before ai are > -ai. Then ai > ai, a 
contradiction. □ 

Proposition 11. At least two ofp±, qi, ■ ■ ■ ,p n , Qn equal 1. 

Proof. Without loss of generality, let exactly one of pi, q± be 1 (the rest are > 3). Then the 
corresponding term ai or b\ is prime and the largest term of the cycle. 

Case 1. pi = 1. Then b\ < |ai and the rest of the run is < -ai- Then a 2 < |ai, 
ò 2 < Hai, an d the rest of the second run is < ^ai- Since ali further p iy qi (if any) are also 
> 3, they do not increase the maximum. We iterate our bounding to get 61 < |§§ai < |ai 
and the rest of the run is < |ai. Then a 2 < §fai, 6 2 < ff ai, an d ^ ne res ^ °^ ^ ne terms before 
ai are < ^a x < |oi. Then ai < ai, a contradiction. 

Case 2. qi = 1. The first run is < 61 except for b\ itself. Then a 2 < |òi, 6 2 < |òi, 
and the rest of the second run is < y|òi. Since ali further pi,qi (if any) are also > 3, they 
do not increase the maximum. Then ai < ì^b\ < \b\. We iterate our bounding to see that 
ai ^ bl < |òi and that the rest of the run is < Then a 2 < jrj&i, b 2 < ff^i, and the rest of 
the terms before ai are < Then ai < ^61 and b\ < |||oi < &i, a contradiction. □ 

Proposition 12. If there are only two pi, qi, . . . ,p n , q n that equal 1, the two cannot be of 
the forra pi, qi unless q i+x ) = (3,3), (3,5), or (5,3). 

Proof. Without loss of generality, let pi,qi = 1 and ali further Pi,qi > 3. Then b\ is the 
largest term, with b\ < 2a±, ai + bl < |ai, and the rest of the run is < |ai. Then a 2 < ^ai, 

ò 2 < 7 ^ 2+1 ^ ai. Since ali further Pi,qi (if any) are > 3 then ali subsequent terms before 
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ai are < max y^ a i, 4^T a iJ • Hence ai < max y^ai, Q ij? a contradiction unless 
(P2,?2) = (3,3), (3,5), or' (5, 3). q 

These results are particularly restrictive ori cycles of only two nodes, that is, where the 
only divisors are Pi, <?i,p 2 , <? 2 . One might conjecture that 

Conjecture 13. There are no non-trivial cycles of two runs (Le., two even terms). 

Because of these results, ali that is needed to prove this conjecture is a similar argument 
against the cases where one of pi,gi and one of p 2 ,g 2 are 1, and eliminating the three 
exceptions of Proposition [T2J However, consider the cycle signature 7, 1, 2, 1, 5, 2, 2. Using 
these values for si, . . . , s n in the earlier system and scaling as in Theorem [5] gives the cycle 
candidate 13,51,32,83,23,53,38, which would work if 51 were prime. Thus to prove that 
other 'cycles' like this similarly fail, the primality test for an unknown set of numbers may 
be required. 

However, it also seems possible that with a requirement of exactly two runs, primes in 
a signature and terms in a cycle are bounded in some way After ali, longer instances of 
such a cycle only means that runs take longer to terminate, but since runs are recurrences of 
averages, the cycle's two nodes' positions relative to each other should be fairly restricted. 



7 Cycles of a given length 

Regardless of whether the preceding argument can be formalized and generalized to cycles 
of any number of runs, it is stili important that the cases involving cycles of shorter lengths 
are exhausted. How can we do this? Consider what we know: 

• Relationships between signature terms and between divisors (([T]) and (jSJ)) 

• Each signature corresponds to a unique potential cycle (Theorem [5]) 

• Non-existence of one-run cycles (Theorem [6]) 

• Restrictions on possible signatures (Propositions [10] to [T2|) 

These give a way to determine whether two-run cycles of a given length exist, which for cycle 
lengths of 6 to 8 exhaust ali possible cycles of that length: 

Theorem 14. There are no 6-cycles. 

Proof. A 6-cycle must have shape OOEOOE and therefore a signature pi, q±, 2,p 2 , q2, 2. 
Using either ([3]) with run configuration (ki,k 2 ) = (3,3) or the more general ([!]) for n = 6, 

we get 

4pigip 2 <?2 = P1P2 + <?i<?2 + 3(pig 2 + q\P2 + Pi + ?i + P2 + 92) + 8. 

By our previous results, exactly two of pi, qi,P2, q2 must be 1. There are only four cases: 
Case 1. pi,q\ = 1 (equivalent to p 2 ,q2 = 1)- Then 4p 2 q 2 = 7(p 2 + q2 + 2). Since we 

only want solutions over the odd primes, then exactly one of P2 and g 2 is 7 and (p2, (fe) = 

(3, 7), (7, 3). These solutions fail by Proposition [T2l 

Case 2. p±,p 2 = 1. Then qiq 2 = 2qi + 2q 2 + 5 =3- (qi — 2)(g 2 — 2) = 9. The solutions over 

the odd primes are (qx, q 2 ) = (3, 11), (5, 5), (11, 3), though reordering the runs shows (3, 11) 

and (11,3) are equivalent. 
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Case 3. q 1 ,q 2 = 1. Then p x p 2 = 2p 1 + 2p 2 + 5 =^> (pi,p 2 ) = (3, 11), (5,5), (11,3), though 
reordering the runs shows (3, 11) and (11,3) are equivalente. 

Case 4. pi, q 2 = 1 (equivalent to p 2 , q\ = 1). Then qip 2 = Aq\ +4p 2 + 17 =>■ (q x —A){p 2 — 
4) = 33. The solutions over the odd primes are (qi,P2) = (5,37), (37,5). 

Substitute the signature values into (CQ) and solve the system. Since the solution space is 
1- dimensionai, we can express ali the cycle terms ti, . . . ,t n in terms of ti (even better: let 
ti = 1), and then multiply by the common denominator to get the unique cycle candidate: 



ÌPi,qi,P2,q2) 


t 2 /ti 


h/h 


U/ti 




te/ti 


Cycle candidate 


It should be. . . 


(1,3,1,11) 


3/5 


4/5 


7/5 


1/5 


4/5 


5,3,4,7,1,4 


4,7,11, not 4,7,1 


(1,5,1,5) 


1/3 


2/3 


1 


1/3 


2/3 


3,1,2,3,1,2 


2,3,5, not 2,3,1 


(3,1,11,1) 


19/9 


14/9 


1/3 


17/9 


10/9 


9,19,14,3,17, 10 


19,14,11, not 19,14,3 


(5,1,5,1) 


3 


2 


1 


3 


2 


1,3,2,1,3,2 


3,2,5, not 3,2,1 


(1,5,37,1) 


11/41 


26/41 


1/41 


27/41 


14/41 


41,11,26,1,27,14 


11,26,37, not 11,26,1 


(1,37,5,1) 


1/27 


14/27 


1/9 


17/27 


10/27 


27,1,14,3,17, 10 


10,27,37, not 10,27,1 



Table 2: Candidates for a 6-cycle 



The last entry also fails because 1, 14 should be followed by 5, not 3, and because the 
largest term is not prime. Since ali candidates fail, the theorem is proved. □ 

The problem is that the linear system takes divisibility into account, but not divisibility by 
the smallest prime factor, or no division if a sum is already prime. Note that the symmetries 
above do not always occur; here they arise from both the runs being of shape OO E. 

There are also two lemmas that can simplify things: 

Lemma 15. The smallest term in a non-trivial cycle must be a node term (and thus odd), 
and at least 7. 

Proof. Since node terms bound a run's terms, the smallest number of the cycle must also be 
one of its node terms, which are odd by definition. 

Dividing a composite number by its smallest prime factor never produces 1. If 3 is 
the smallest cycle term, the previous members a, b must add to 3, 6 or 9. Since the same 
integer cannot be separated by only one term (the sequence a, b, a continues into trivial cycle 
a, a,...), it follows that the smallest number in the cycle is less than the two preceding 
members. Hence, a and b are greater than 3, giving the two cases (a, b) = (5, 4), (4, 5) which 
are tributary to, but not part of, non-trivial cycles. 

If 5 occurs in a sequence, the previous members a, b must add to 5, 10, 15, or 25, and 
if they are to be greater than 5, (a, b) = (6, 9), (7, 8), (8, 7), (9, 6), (6, 19), . . . (19, 6) (this list 
would be deduced by the first half of the 'direct predecessor' method of §2]). Calculation 
shows these are tributary to, but not part of, non-trivial cycles. 

7 is the smallest member of the 136-cycle, completing the proof. □ 

Note that this result immediately disqualifies ali the 6-cycle candidates. Also, using lower 
bound arguments omitted here, we can eliminate the three exceptions of Proposition [12] for 
ali two-run cycles: 
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Lemma 16. In a two-run cycle, p\,qi cannot both be 1 andp 2 ,q2 cannot both be 1. 

Thus for cycles of longer length, we can apply Theorem[T2fs method of generating candi- 
dates and easily show why they fail. For 7-cycles, the only possible run configurar, ion is (3, 4); 
for 8-cycles, the two possible configurations are (3,5) and (4,4). At the time of writing, we 
have confirmed that: 

Theorem 17. There are no cycles of length 8 or less. There are no two-run cycles of length 
11 or less. 

8 Open questions 

1. Give an algorithm for finding solutions {pi, q{\ to (jSJ) given a run configuration (k\, . . . , k n ), 
at least for n = 3. 

2. Determine if and how divisors and terms are bounded based on the number of runs. 

3. Find more non-trivial cycles, or prove that no more exist. 

4. Prove that subprime Fibonacci sequences do not increase indefinitely. 

5. Explore the subprime variants of other linear recurrences. 
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